CATGGTAGACGGCTGCCCGGAGGGACCACGCGTCTGAGACCGGCGATCGGACCGCCAAAACCATGGGTAGCAATCGGGGCCGCAAG 
GCCGGAGGGGGCTCTCAGGACTTCGGCGCGGGACTCAAGTACAACTCCCGGCTAGAGAACATGAATGGCTTTGAGGAGGGTGTGGA 
GTTCCTGCCTGCGAACAATGCCAAGAAAGTGGAGAAGCGAGGCCCCAGGCGCTGGGTGGTGCTGGTGGCAGTGCTGTTCAGCTTCC 
TCTTGCTCTCCCTCATGGCTGGCTTGCTGGTGTGGCACTTCCATTATCGGAATGTGCGGGTTCAAAAAGTCTTCAATGGCCATCTG 
AGGATCACAAATGAGATCTTTCTGGATGCGTATGAGAACTCCACCTCCACAGAGTTTATCAGCCTGGCCAGCCAGGTGAAGGAGGC 
GCTGAAGCTGCTGTACAATGAAGTCCCTGTCCTGGGTCCCTACCACAAGAAGTCGGCTGTAACTGCCTTCAGTGAGGGCAGTGTCA 
TCGCCTACTACTGGTCAGAGTTCAGCATCCCCCCACACCTGGCAGAAGAGGTTGATCGCGCCATGGCTGTGGAGCGAGTTGTAACA 
TTGCCACCCCGAGCACGGGCACTGAAATCCTTCGTGCTAACATCTGTGGTGGCCTTCCCCATTGACCCCAGAATGCTGCAGAGGAC 
TCAGGACAACAGCTGCAGTTTTGCCCTGCATGCCCATGGTGCAGCAGTGACACGCTTCACTACCCCTGGCTTCCCCAACAGTCCCT 
ACCCGGCGCATGCCCGCTGCCAGTGGGTCCTGCGGGGGGACGCCGACTCTGTGCTGAGCCTCACCTTCCGAAGCTTTGATGTCGCT 
CCCTGTGATGAGCATGGCAGTGACCTGGTCACCGTGTATGATAGCCTGAGCCCCATGGAACCCCACGCTGTGGTGCGGCTGTGTGG 
CACCTTCTCACCCTCCTACAACCTGACTTTCCTCTCCTCCCAGAACGTCTTCCTTGTCACGCTGATAACCAATACTGACCGGCGAC 
ATCCTGGCTTTGAGGCCACTTTCTTCCAGCTGCCCAAGATGAGCAGCTGTGGCGGCTTTTTGAGTGACACCCAAGGGACATTTAGC 
AGCCCCTACTATCCAGGCCACTACCCGCCCAACATCAACTGCACATGGAATATCAAGGTGCCCAACAACCGGAACGTGAAGGTGCG 
CTTCAAACTCTTCTATCTGGTGGACCCCAACGTACCAGTGGGCTCCTGCACCAAGGACTATGTGGAGATCAACGGGGAGAAGTACT 
GCGGTGAGAGGTCCCAGTTTGTGGTGAGCAGCAACAGCAGCAAGATTACAGTCCACTTCCATTCTGATCACTCGTACACGGACACC 
GGGTTCCTAGCTGAGTACCTCTCCTACGACTCCAACGACCCGTGCCCAGGGATGTTCATGTGCAAGACTGGACGGTGCATCCGAAA 
GGAACTGCGCTGCGACGGCTGGGCAGACTGCCCGGATTATAGTGATGAGCGTTACTGCCGATGCAATGCCACCCACCAGTTCACGT 
GCAAAAACCAGTTCTGCAAGCCCCTCTTCTGGGTCTGTGACAGTGTCAACGACTGTGGGGACGGAAGTGACGAGGAGGGCTGCAGC 
TGTCCTGCTGGGAGTTTCAAGTGTTCCAATGGGAAGTGTCTCCCTCAGAGCCAGAAGTGTAATGGGAAGGACAACTGTGGAGATGG 
GTCTGACGAGGCTTCATGTGACAGCGTGAATGTCGTCTCTTGCACCAAATATACCTACCGCTGCCAAAATGGCCTCTGTCTGAGCA 
AGGGCAACCCTGAGTGTGATGGGAAGACGGACTGTAGCGATGGCTCCGATGAGAAAAACTGTGACTGTGGGCTGCGATCCTTTACC 
AAACAGGCTCGCGTGGTTGGTGGCACGAATGCGGACGAGGGCGAGTGGCCCTGGCAGGTGAGCCTCCACGCCCTGGGCCAGGGCCA 
CTTGTGTGGGGCCTCGCTCATCTCTCCTGACTGGCTGGTCTCTGCAGCTCATTGCTTTCAGGATGACAAAAATTTCAAGTACTCAG 
ACTACACGATGTGGACGGCCTTCCTGGGTCTGCTGGACCAGAGCAAGCGCAGTGCCTCTGGGGTGCAGGAGCTGAAGCTCAAACGT 
ATCATCACCCACCCTTCCTTCAATGATTTCACCTTCGACTATGACATCGCCTTGCTGGAGCTGGAGAAGTCGGTGGAGTACAGCAC 
CGTCGTGCGCCCCATCTGCCTGCCTGATGCTACCCATGTCTTCCCTGCTGGCAAGGCCATCTGGGTCACAGGCTGGGGGCACACAA 
AAGAGGGAGGTACCGGAGCGCTGATCCTGCAGAAGGGTGAGATCCGTGTCATCAACCAGACCACCTGTGAGGACCTCATGCCGCAG 
CAGATCACCCCACGAATGATGTGTGTGGGTTTCCTCAGTGGGGGTGTGGACTCCTGCCAGGGTGACTCTGGTGGCCCCTTGTCAAG 
CGCGGAGAAAGATGGGCGAATGTTCCAGGCTGGTGTGGTGAGCTGGGGTGAAGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACA 
CAAGGCTCCCTGTAGTTCGGGACTGGATCAAAGAGCACACTGGGGTATAGCAGCATGGACAGACAGCCGACCACAAACACCCACAG 
GGATGCCCGACATGCACACCTGGATACAGGAGAGGAACACTGACGACATTTATGCTGTGGCCTCCCCCCCCCAACACAACCCAGAC 
TGTGAACTGCATCCTTAGGACTCAGAGTTCTTCCAAAGTGGGACCCCCTCAAGAGTTGGAGAGAGAACTTGCGTGCTAGCGGCCCA 
GCCTGGGGGCAAGGGTTTGATGGCAGCCTTCCCCCTCTAGCCCTGAGCTGGGTGAAGATGATGCTGTCCCGGAGAGCTGCTTCCAA 
CTGTCATTGAGCTCCCGGGAGCCCTATGGGAGGAGGGGCTCAGGGTCACTCTTTTCAGGAAGCGCCAGCCCTAGGAACCCCAGAAA 
AGAGTGGTACCTAAGGCTGAAATTGTTTTGCTGTTGCCAGGGGTGGGTATTTGAGAGTAAAACATTTTATTTCTTTTTAAAAAAAA 
AAAAAAAAAA (SEQ ID N0:1) 

MGSNRGRKAGGGSQDFGAGLKYNSRLENMNGFEEGVEFLPANNA 

KKVEKRG PRRWWL VAVL F S FLLL SLMAGLLVWHFH YRNVRVQKVFNGHLR I TNE I FL 
DAYENSTSTEFISLASQVKEALKLLYNEVPVLGPYHKKSAVTAFSEGSVIAYYWSEFS 
IPPHLAEEVDRAMAVERWTLPPRARALKSFVLTSWAFPIDPRMLQRTQDNSCSFAL 
HAHGAAVTRFTTPGFPNSPYPAHARCQWVLRGDADSVLSLTFRSFDVAPCDEHGSDLV 
TVYDSLSPMEPHAWRLCGTFSPSYNLTFLSSQNVFLVTLITNTDRRHPGFEATFFQL 
PKMSSCGGFLSDTQGTFSSPYYPGHYPPNINCTWNIKVPNNRNVKVRFKLFYLVDPNV 
PVGSCTKDYVEINGEKYCGERSQFWSSNSSKITVHFHSDHSYTDTGFLAEYLSYDSN 
DPCPGMFMCKTGRCIRKELRCDGWADCPDYSDERYCRCNATHQFTCKNQFCKPLFWVC 
DSVNDCGDGSDEEGCSCPAGSFKCSNGKCLPQSQKCNGKDNCGDGSDEASCDSVNWS 
CTKYTYRCQNGLCLSKGNPECDGKTDCSDGSDEKNCDCGLRSFTKQARWGGTNADEG 
EWPWQVSLHALGQGHLCGASLISPDWLVSAAHCFQDDKNFKYSDYTMWTAFLGLLDQS 
KRSASGVQELKLKRIITHPSFNDFTFDYDIALLELEKSVEYSTWRPICLPDATHVFP 
AGKAIWVTGWGHTKEGGTGALILQKGEIRVINQTTCEDLMPQQITPRMMCVGFLSGGV 

DSCQGDSGGPLSSAEKDGRMFQAGWSWGEGCAQRNKPGVYTRLPWRDWIKEHTGV 

(SEQ ID NO: 2) 



FIGURE 1 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



C ATGGT AG AC GGCTGC C C GGAGGG AC C AC GC GTC TG AG AC C GGC GATC GG AC C GC C AAAA 
CCATGGGTAGCAATCGGGGCCGCAAGGCCGGAGGGGGCTCTCAGGACTTCGGCGCGGGAC 
TCAAGTACAACTCCCGGCTAGAGAACATGAATGGCTTTGAGGAGGGTGTGGAGTTCCTGC 
C TGC GAAC AATGC C AAGAAAGTGGAGAAGCGAGGC CC C AGGC GCTGGGTGGTGC TGGTGG 
CAGTGCTGTTCAGCTTCCTCTTGCTCTCCCTCATGGCTGGCTTGCTGGTGTGGCACTTCC 
ATT ATC GGAATGTGC GGGTTC AAAAAGTC TTC AATGGC CATC TG AGG ATC AC AAATG AG A 
TC TTTC TGG ATGC GT ATGAG AAC TC C AC C TCC AC AG AGTTT ATC AGC C TGGC C AGC C AGG 
TG AAGG AGGC GC TGAAGC TGCTGT AC AATG AAGTC C C TGTC C TGGGTC C C T AC C AC AAG A 
AGTCGGCTGTAACTGCCTTCAGTGAGGGCAGTGTCATCGCCTACTACTGGTCAGAGTTCA 
GCATCCCCCCACACCTGGCAGAAGAGGTTGATCGCGCCATGGCTGTGGAGCGAGTTGTAA 
CATTGCCACCCCGAGCACGGGCACTGAAATCCTTCGTGCTAACATCTGTGGTGGCCTTCC 
CCATTGACCCCAGAATGCTGCAGAGGACTCAGGACAACAGCTGCAGTTTTGCCCTGCATG 
CCCATGGTGCAGCAGTGACACGCTTCACTACCCCTGGCTTCCCCAACAGTCCCTACCCGG 
CGC ATGCCC GCTGC C AGTGGGTC C TGC GGGGGGAC GC C GAC TCTGTGC TGAGC C TC AC C T 
TCCGAAGCTTTGATGTCGCTCCCTGTGATGAGCATGGCAGTGACCTGGTCACCGTGTATG 
ATAGCCTGAGCCCCATGGAACCCCACGCTGTGGTGCGGCTGTGTGGCACCTTCTCACCCT 
CCTACAACCTGACTTTCCTCTCCTCCCAGAACGTCTTCCTTGTCACGCTGATAACCAATA 
CTGACCGGCGACATCCTGGCTTTGAGGCCACTTTCTTCCAGCTGCCCAAGATGAGCAGCT 
GTGGCGGCTTTTTGAGTGACACCCAAGGGACATTTAGCAGCCCCTACTATCCAGGCCACT 
ACCCGCCCAACATCAACTGCACATGGAATATCAAGGTGCCCAACAACCGGAACGTGAAGG 
TGC GCTTC AAACTC TTC TATC TGGTGGAC C CC AAC GTAC C AGTGGGC TC CTGC AC C AAGG 
ACTATGTGGAGATCAACGGGGAGAAGTACTGCGGTGAGAGGTCCCAGTTTGTGGTGAGCA 
GCAACAGCAGCAAGATTACAGTCCACTTCCATTCTGATCACTCGTACACGGACACCGGGT 
TCCTAGCTGAGTACCTCTCCTACGACTCCAACGACCCGTGCCCAGGGATGTTCATGTGCA 
AGACTGGACGGTGC ATC CGAAAGGAAC TGC GCTGC GAC GGC TGGGCAGAC TGC CCGGATT 
ATAGTGATGAGCGTTACTGCCGATGCAATGCCACCCACCAGTTCACGTGCAAAAACCAGT 
TC TGC AAGCC C C TC TTC TGGGTC TGTGAC AGTGTC AAC GAC TGTGGGGAC GGAAGTGAC G 
AGGAGGGCTGCAGCTGTCCTGCTGGGAGTTTCAAGTGTTCCAATGGGAAGTGTCTCCCTC 
AGAGCC AG AAGTGT AATGGG AAGG AC AAC TGTGG AG ATGGGTC TG AC G AGGC TTC ATGTG 
ACAGCGTGAATGTCGTCTCTTGCACCAAATATACCTACCGCTGCCAAAATGGCCTCTGTC 
TGAGCAAGGGCAACCCTGAGTGTGATGGGAAGACGGACTGTAGCGATGGCTCCGATGAGA 
AAAACTGTGAC TGTGGGCTGC GATC C TTTACC AAAC AGGC TC GC GTGGTTGGTGGC AC GA 
ATGC GGAC GAGGGC GAGTGGC C C TGGC AGGTGAGC C TC C AC GCCC TGGGC C AGGGC C AC T 
TGTGTGGGGC C TC GC TC ATC TC TC C TGAC TGGC TGGTC TC TGC AGC TC ATTGC TTTC AGG 
ATGACAAAAATTTCAAGTACTCAGACTACACGATGTGGACGGCCTTCCTGGGTCTGCTGG 
ACC AG AGC AAGC GC AGTGC C TC TGGGGTGCAGGAGC TGAAGC TC AAAC GT ATC ATC ACC C 
ACCCTTCCTTCAATGATTTCACCTTCGACTATGACATCGCCTTGCTGGAGCTGGAGAAGT 
CGGTGGAGTACAGCACCGTCGTGCGCCCCATCTGCCTGCCTGATGCTACCCATGTCTTCC 
CTGCTGGCAAGGCC ATC TGGGTC AC AGGC TGGGGGC AC AC AAAAGAGGGAG [ GTACCGGA 
GCGCTGATCCTGCAGAAGGGTGAGATCCGTGTCATCAACCAGACCACCTGTGAGGACCTC 
ATGCCGCAGCAGATCACCCCACGAATGATGTGTGTGGGTTTCCTCAGTGGGGGTGTGGAC 
TCCTGC ] C AGGGTG AC TC TGGTGGC C C C TTGTC AAGCGC GG AG AAAG [ ATGGGC G AATGT 
TC C AGGCTGGTGTGGTGAGC TGGGGTGAAGGC TGC GCTC AGAGGAAC AAGC C AGGC GTGT 
ACACAAGGCTCCCTGTAGTTCGGGACTGGATCAAAGAGCACACTGGGGTATAGCAGCATG 
GACAGACAGCCGACCACAAACACCCACAGGGATGCCCGACATGCACACCTGGATACAGGA 
GAGGAACACTGACGACATTTATGCTGTGGCCTCCCCCCCCCAACACAACCCAGACTGTGA 
AC TGC ATC C TT AGGAC TC AG AGTTC TTC C AAAGTGGG AC C C C C TC AAG AGTTGG AG AGAG 
AAC TTGCGTGC TAGC GGCC C AGC C TGGGGGC AAGGGTTTGATGGC AGC C TTC CC C C TC TA 
GC C C TGAGC TGGGTG AAG ATGATGC TGTCC C GG AG AGC TGC TTC C AAC TGTC ATTG AGC T 
CCCGGGAGCCCTATGGGAGGAGGGGCTCAGGGTCACTCTTTTCAGGAAGCGCCAGCCCTA 
GGAACCCCAGAAAAGAGTGGTACCTAAGGCTGAAAT] TGTTTTGCTGTTGCCAGGGGTGG 
GTATTTGAGAGTAAAACATTTTATTTCTTTTTAAAAAAAAAAAAAAAAAA 
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Gene Sequence Structure 



2466 bp 



Sequence Deleted 



2505 bp 



Size of full-length 
cDNA: 3106 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 2035 



Arm Length: 
5': 3.8 kb 
3': 1 kb 



o 



Neo 




Si 



Targeting Vector: 
Endogenous Locus 



p* Not drawn to scale 

SI 
O 



5 ' > TTCC C C ATTG AG AC TGGC TT A 

CCCCGGAAGCTGCCTGCCTCAGTC 

TCCCGCTTCCTGTCTCCCCAGGTA 

CCGGAGCGCTGATCCTGCAGAAGG 

GTGAGATCCGTGTCATCAACCAGA 

C C AC C TGTG AGG AC C TC ATGC C GC 

AGCAGATCACCCCACGAATGATGT 

GTGTGGGTTTCCTCAGTGGGGGTG 

TGGACTCCTGC<3 1 

(SEQ ID NO: 2) 



5 ■ > ATGGGC GAATGTTC C AGGC TG 
GTGTGGTGGGC TGGGGTGAAGGC T 
GCGCTCAGAGGAACAAGCCAGGCG 
TGTAC AC AAGGC TC C C TGT AGTTC 
GGGACTGGATCAAAGAGCACACTG 
GGGTATAGCAGCATGGACAGACAG 
C C GAC C AC AAAC AC C C AC AGGG AT 
GCCCGACATGCACACCTGGATACA 
GGAGAGGGAC A< 3 1 
(SEQ ID NO: 3) 
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